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The Testbed for LISA Analysis (TLA) Project aims to facilitate the development, validation 
and comparison of different methods for LISA science data analysis, by the broad LISA Science 
Community, to meet the special challenges that LISA poses. It includes a well-defined Simulated 
LISA Data Product (SLDP), which provides a clean interface between the communities that have 
developed to model and to analyze the LISA science data stream; a web-based clearinghouse (at 
: //tla. gravity. psu. edu >) providing SLDP software libraries, relevant software, papers and 
other documentation, and a repository for SLDP data sets; a set of mailing lists for communication 
between and among LISA simulators and LISA science analysts; a problem tracking system for 
SLDP support; and a program of workshops to allow the burgeoning LISA science community to 
further refine the SLDP definition, define specific LISA science analysis challenges, and report their 
results. This note describes the TLA Project, the resources it provides immediately, its future plans, 
and invites the participation of the broader community in the furtherance of its goals. 
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I. INTRODUCTION 

The Laser Interferometer Space Antenna (LISA) will 
be sensitive to gravitational waves arising from the coa- 
lescence of 10'*~10^ M© black hole binaries in the centers 
of distant galaxies, the capture of stars, solar mass and 
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II. GOALS, OBJECTIVES, STRATEGIES 

The principal goal of the TLA is enable the burgeoning 
LISA science community to focus its energy on prepar- 
ing to meet the challenges posed by LISA science anal- 
ysis. A short-term objective, which advances us toward 
this goal, is to enable the science community to identify 
the key LISA science data analysis challenges and pro- 
vide proof- of -principle demonstrations of different analy- 
sis techniques that meet these key challenges. 

To achieve its goals the TLA Project has identified 
the need for and provides several tools and an enabling 
infrastructure for collaboration. These include 

• A web-based clearinghouse 
(<http://tla.gravity.psu.edu>) for sharing 
LISA data sets and related software, documenta- 
tion and publications (and which also acts as a 
portal to "all things LISA" ) ; 

• Several archived mailing lists for the discussion re- 
lated to the use of these data sets and tools; 

• A well-defined "Simulated LISA Data Product" 
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(a container for simulated LISA data that can be 
used to develop, validate and compare analysis 
methodologies), together with supported and main- 
tained software libraries that can read and write the 
SLDP; 

• A program of community open workshops and 
working meetings to further develop the SLDP, de- 
velop community-based analysis challenges, and fa- 
cilitate their execution and the broad dissemination 
of their results; and 

• A governance structure that allows the science com- 
munity to participate in the TLA at all levels, from 
setting the objectives and strategy that will move 
the TLA toward its goals, to contributing analysis 
or simulations tools, to helping to further develop 
the collaborative infrastructure. 

The following sections describe each of these components 
of the TLA. 



III. THE SIMULATED LISA DATA PRODUCT 
AND RELATED SOFTWARE 

A. Introduction 

Simulating LISA data and noise is an involved and 
complicated enterprise. There are different ways of repre- 
senting the LISA science data (e.g., different TDI formu- 
lations), different approximations for constructing sim- 
ulated LISA data (low-frequency, rigid adiabatic, etc.), 
several different LISA constellation ephemeris approxi- 
mations (circular orbit, second-order eccentricity, etc.), 
and several different LISA simulators, each providing 
their output in a different format, etc. 

Nevertheless, the basic or schematic list of necessary 
and sufficient information required to carry-out LISA 
data analysis is quite short: 

• The constellation observables: e.g., a specific set of 
TDI variables; 

• The constellation response: i.e., the relationship 
between the observables and incident gravitational 
plane waves; 

• The constellation ephemeris: i.e., the position and 
orientation of the constellation as a function of 
time; 

• A characterization of the constellation observable 
noise; and 

• Associated metadata (e.g., sampling rates for ob- 
servables, ephemeris, time- varying response). 

If these are given then the particular choices made and 
details associated with the production of simulated LISA 



data sets are largely or entirely irrelevant to the data an- 
alyst. To lower the barrier to community participation in 
the development, validation and comparison of different 
methods for LISA analysis it is thus crucial to separate 
the "production" of simulated LISA data from its "use" 
in science analysis. 

To make this separation and allow the processes of sim- 
ulated data production and analysis to be carried out in- 
dependently we have developed the Simulated LISA Data 
Product (SLDP). The SLDP is a well-defined container 
for the information, described above, that is necessary 
and sufficient for the development, validation and com- 
parison of analysis methods. To facilitate the use of the 
SLDP it is defined through an application programmer 
interface and the TLA Project provides and maintains 
implementations of that interface in the programming 
languages most commonly used for the production and 
analysis of simulated LISA data. The SLDP is the prin- 
cipal enabling technology of the TLA Project. 

In defining the interface between the simulator and the 
analyst as we have in the list above, we have made sev- 
eral implicit assumptions about the distinct responsibil- 
ities of the LISA Project and the LISA Science Analysis 
Community. In particular, we assume that when LISA 
is flying the LISA Project, and not the science analysis 
community, is responsible for initial processing of LISA 
data for the purpose of assuring the data integrity, char- 
acterizing the detector noise, identifying and removing 
instrumental artifacts, determination of the constellation 
response function, and preparing a structured data set 
that is ready for use in the scientific investigations that 
are the missions goal. While these are all important and 
challenging problems, they are distinct from the prob- 
lem of analyzing the LISA data for gravitational wave 
sources, which is the thrust of the TLA Project. (1| 

The SLDP is intended to be an evolving standard. As 
LISA'S design matures the SLDP will also evolve to re- 
main faithful to what we expect the data available for 
science analysis will involve. Additionally, through the 
use of the SLDP we expect to learn more about how a 
LISA data product that facilitates the process of science 
analysis should be structured. 

B. Implementing the SLDP 

Both the producers and users of SLDP data sets re- 
quire a clean and well-defined software interface to the 
data sets: i.e., they should not need to be concerned 
with the underlying data set storage format. A suitable 
interface should be 

• Well-defined; 

• Stable; 

• Supported on all the major hardware platforms 
used for data set production and analysis devel- 
opment; 
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• Supported in all the major programming languages 
or environments used for data set production and 
analysis development; and 

• Insulate both simulators and analysts from the un- 
derlying details of the storage format. 

To lower the barrier to participation in the development 
of simulated LISA data and the development of analysis 
methodologies the TLA Project provides, and is com- 
mitted to continuing to provide and support, such an 
interface. 

While not of primary concern to the simulators or an- 
alysts, the underlying choice of data product storage for- 
mat can have a significant impact on the success of this 
effort and the resources required to sustain it. The cen- 
tral objectives that governed the TLA's choice of data 
storage format were to minimize the barrier to participa- 
tion presented by the storage format and minimize the ef- 
fort required by the TLA Project in its development and 
maintenance. With these objectives in mind we imposed 
the following requirements on the data storage format: 

• The storage format should be flexible enough to 
handle the typed and multi-dimensional array data 
objects that will represent LISA data; 

• The storage format should be structured and self- 
documenting: i.e., it should provide a table of 
contents of stored data objects, including object 
descriptions and arbitrary user-defined object and 
data product metadata; 

• APIs (i.e., software libraries) for reading and writ- 
ing files in this format should exist for all major 
programming languages, analysis development en- 
vironments, and hardware platforms; 

• The storage format definition, implementation and 
APIs should be well-documented, controlled and 
supported; 

• Storage format implementations and APIs that are 
controlled and supported by permanent organiza- 
tions representing large and mature science com- 
munities are preferred over those that are not; 

• Storage format implementations and APIs that are 
free and freely distributable (i.e., no copyright and 
no export restrictions) are preferred over those that 
are not. 

The version 5 Hierarchical Data Format (HDF5) (cf. 
<http://hdf.ncsa.uiuc.edu/HDF5/>) meets all the re- 
quirements posed above: 

• HDF5 is a general purpose library and file format 
for storing scientiflc data. It is constructed about 
two primary objects: datasets, which are multi- 
dimensional arrays of data elements, and groups, 
which are hierarchically structured sets of datasets; 



• Datasets and groups are named and arbitrary 
metadata can be associated with each; 

• IIDF5 APIs are available for C, C-l— 1-, Fortran90, 
Mathematica, and Matlab, and the distribution is 
tested on Linux (ia64, x86_64, and x86) and Mac 
OS X; 

• The IIDF5 format and APIs are well-documented 
and designed for efficient storage and I/O in high 
performance, data intensive computing environ- 
ments; 

• IIDF5 was developed, and is maintained and sup- 
ported, by the National Center for Supercomputer 
Applications (NCSA). It has a large and active user 
community that encompasses a large range of ap- 
plications, including the packaging and distribution 
of NASA Earth Observing System datasets; 

• IIDF5 is free and freely distributable in source form 
or as pre-compiled binary distributions for all ma- 
jor computing platforms. 

SLOP data sets provided by the TLA will be in IIDF5 
format, include a TLA supported API, and implementa- 
tions of the API for use in all major programming envi- 
ronments. 



IV. COLLABORATIVE INFRASTRUCTURE: 
WEB CLEARINGHOUSE, MAILING LISTS AND 
WORKSHOP PROGRAM 

Communication and collaboration between and among 
simulators and analysts is critical for meeting the 
TLA Project goals. To support this commu- 
nication and collaboration the TLA Project in- 
cludes a publicly accessible web-based clearinghouse at 
< ]http : //tla ■ gravity . psu . edu ^ mailing lists to facil- 
itate discussion amongst simulators and analysts; and 
a program of community-open workshops to enable the 
community to collaborate on the production of simulated 
LISA data, and development, validation and comparison 
of different analysis methodologies. 

The principal purpose of the TLA web site is to act as 
a clearinghouse for the distribution, exchange and sup- 
port of SLDP data sets, the software required to read 
and write them, and any relevant documentation. In ad- 
dition. The TLA web site will provide links to (and, on 
request, act as a repository for) any and all relevant soft- 
ware tools, documentation, technical notes, and science 
publication references related to the TLA's goals. Fi- 
nally, the TLA site will also serve as a web-portal to "all 
things LISA" , providing links to other LISA sites and 
sites relevant to LISA analysis goals and challenges. 

To share information of relevance to simulators and 
analysis developers and facilitate discussion between and 
amongst the members of these communities the TLA also 
supports several mailing lists. Initially four mailing lists 
are supported: 
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• <TLA-Simulate@Gravity .PSU.Edu> is an unmod- 
erated list where questions and discussion related 
to the production of simulated LISA data can take 
place; 

• <TLA-Analyze@Gravity .PSU.Edu> is an unmoder- 
ated list where questions and discussion related to 
the development, validation and comparison of sim- 
ulated LISA data can take place; 

• <TLA-Developer@Gravity . PSU . Edu> is an unmod- 
erated list where to support discussion amongst 
those taking part in the development of TLA 
Project support software; and 

• <TLA-Announce@Gravity .PSU.Edu> is a moder- 
ated list, deliberately kept to a low volume, that 
provides announcements of broad interest to the 
broad community interested in the TLA and its 
goals 

Finally, the TLA Project will host a series of workshops 
and working meetings, intended to further the goal of en- 
abling the burgeoning LISA science community to focus 
its energy on preparing to meet the challenges posed by 
LISA science analysis. Meetings that are part of this pro- 
gram will focus on the simulation of LISA data; the devel- 
opment, validation and comparison of analysis method- 
ologies; establishing contact and encouraging collabora- 
tion between the LISA science analysis community and 
the statistics, applied mathematics, and astrostatistics 
communities; and the development and execution of anal- 
ysis challenges, focused on demonstrating the capability 
of analysis technology in meeting LISA's science goals. 

V. GETTING INVOLVED 

The goal of the TLA is to enable the burgeoning LISA 
science community to focus its energy on preparing to 
meet the challenges posed by LISA science analysis. To- 
ward that end the TLA will do everything possible to 
enable the participation of any and all members of the 
LISA science community in TLA activities. Interested 
members of the LISA Science Community can partici- 
pate by 

• Simulating LISA data, with different levels of fi- 
delity to the LISA design and expected gravita- 
tional wave sources, and providing data sets to be 
disseminated via the TLA web clearinghouse; 

• Developing, validating and comparing data analysis 
methodologies by analyzing data sets and reporting 
their results; 

• Contributing support software to enable the TLA 
Project goals; 



• Providing their expertise to the broad community 
by taking part in the mailing list discussions; 

• Taking part in workshops and working meetings de- 
voted to LISA science analysis; and 

• Joining the TLA Development Team and contribut- 
ing to the development of TLA Project Infrastruc- 
ture: API implementations in new languages; the 
design and development of subsequent versions of 
the API, SLDP, and other collaborative infrastruc- 
ture; the administration of analysis drills and the 
evaluation of their results; etc. 

The TLA web site (http : //tla . gravity . psu . edu) pro- 
vides information on how to participate in all these ways. 



VI. CONCLUSIONS 

The goal of the Testbed for LISA Analysis (TLA) 
Project is to enable and facilitate the development, val- 
idation and comparison of different methods for LISA 
science data analysis. Its target audience is the broad 
LISA Science Community in general, and the communi- 
ties of LISA science data modelers and LISA science data 
analysts in particular. To ease the exchange of informa- 
tion between these two communities, the TLA includes 
a well-defined Simulated LISA Data Product (SLDP); 
a web-based clearinghouse ( http://tla.gravity.psu.edu ;J 
to provide SLDP data sets, software libraries, and other 
relevant software, papers and documentation; a set of 
mailing lists to support communication among the sub- 
communities; and a problem tracking system for SLDP 
and related software support. In order to ensure that the 
TLA continues to grow and adapt to the needs of the 
broad scientific community, the TLA Project will sup- 
port a program of workshops to allow the burgeoning 
LISA science community to further refine the SLDP defi- 
nition, define specific LISA science challenges, and report 
their results. 
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[1] As in any experimental enterprise, tlie lower the signal 
to noise the greater the degree of instrument familiarity 
required of any science team that presumes to carry-out 
science investigations with the instrument data. Neverthe- 
less, as a practical matter there will always be a starting 
place for science investigations and this starting place will 
be a data product that is constructed, using the fullest 



awareness of the instrument's behavior, to be as clean 
of instrumental signatures or artifacts as the instrument 
team can make it. The SLDP is extensible and is designed 
with the intent that future generations will have content 
that is increasing faithful to the complexity of a final LISA 
Data Product. 



